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Introduction 

The AAA ATPase (ATPases Associated with var- 
ious Activities: Erdmann el al., 1991) are a large 
family of ATPases involved in a wide array of cel- 
lular processes and occuring in all organisms. 
Twenty-one members of this family are present in 
the genome of the yeast Saccharamyccs cerevisiae 
alone. These include SecJ8, which is part of the 
machinery that fuses transport vesicles to their tar- 
get membranes (Eakle et aL, 1988; Wilson el al., 
1989), Cdc48, which mediates homotypic mem- 
brane fusion during the cisternal regrowth of Golgi. 
stacks and ER membranes (Rabouille et al., 1995; 
I^tterich et al, 1995), YtalO-12, Ymel and 26 S pro* 
tea some subunits, which are involved in protein 
degradation (Arlt et al. 1996; Leonhard et at., 1996; 
Lupas et aL, 1995), and Pasl, which is essential for 
peroxisome biogenesis {Erdmann et al., 1991). The 
hallmark of these proteins is a highly conserved 
domain of approximately 250 residues, which con- 
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tains the Walker A and B motifs (Walker et al., 
1982) typical of P-loop ATPases as well as a second 
region of high sequence conservation, C -terminal 
to the ATP-binding motifs (Erdmann ci ah, 1991; 
Tornoyasu et aL. 1993). Proteins of the AAA family 
may contain one or two ATPase domains. The 
function and mechanism of the AAA domain is 
still unknown; its association with proteases in the 
26 S proteasome and in FtsH homoJogucs indicates 
an involvement tn the disassembly and /or unfold- 
ing of proteins. Sequence comparisons show that 
the AAA proteins belong to a large superf amity of 
ATPases (Lupas et aL, 1997b), many of which are 
also involved in proteolysis and in the disassembly 
of protein complexes (Langer & Neupert, 1996; 
Schirmer el al., 1996; Cottesman et al, 1997). These 
include the Op ATPases, Lon, and the heliease 
subunit RuvB. 

During the characterization of two operons of 
20 S proteasome subunits in the nocardioform acti- 
nomycetc Rliodococcus erythropolis strain NI86/21 
(Tamura et aL, 1995), wc "discovered a gene encod- 
ing a member of the AAA family approximately 
1.5 kb 5' of the second proteasome operon. Here 
we describe the cloning, sequencing and 
expression of this gene. The purified protein has a 
low, A^ethylmaleimide (NHM)-sensitive, Mg 2 '- 
dependent ATPase activity and forms homo-hex- 
americ rings. Because of this property we named it 
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ARQ for AAA ATPase forming Ring-shapod Com- 
plexes. ARC is a divergent member of the AAA 
ATPases, containing two insertions in the con- 
served domain, which are not observed in other 
members of the family. Unlike most AAA proteins 
from eubacteria, ARC is not a FtsH homologue; it 
branches off close to the root of the AAA fauuly 
dendrogram and is not clearly assignable to any of 
its main branches. 



Results 

Gene organization 

The DNA sequences of the two unlinked oper- 
ons, prcB t A, and prcB 2 A 2 , encoding the suhuruts 
of the 20 S proteasome of K. erythropoUs were 
determined bv Tamura et at. (1995). In both DNA 
regions, an open reading frame {or/6 1 and orfS 2f 
respectively, with the same orientation as the pro- 
teasome genes) was found 5' of the proteasome 
operon. The same gene organization was identified 
by genomic semrencing in Mycobacterium leprae 
{Lupas et at, 1997a) and, more recently, also in 
Mycobacterium tuberculosis, two other nocardioform 
actinomycetes. Further upstream, the two Rhodo- 
coccus DNA regions are quite different: the 5' 
region of or/6, is occupied by the the operon, 
encoding the cytochrome P-450 system required 
for thkKrarbamate degradation (Nagy & al, 1995), 
whereas further sequence analysis now has shown 
that the 5' region of orf6 2 is more similar to the 
equivalent DNA region in the mycobacteria, m the 
three species, a similar large open reading frame is 
present in the same orientation as the proteasome 
genes. However, the distances between these 
ORFs, encoding putative ATPases, and the down- 
stream orfS genes is considerably smaller in Rhodo- 
coccus (about 1.5 kb) and Al tuberculosis (1.V kb) as 
compared to M. leprae (3.8 kb). There is no obvious 
sequence similarity between the intergenic region 
of Rhodococcus and Mycobacterium (data not 
shown). The putative start codon of the arc gene in 
Rhodococcus is preceded by a ribosome binding site 
(GGGAGGAGA) at a distance of 5 bp. The gene is 
immediately followed by a potential stem-loop 
structure, which may be involved in transcription 
terrnination (stem length of 22 with three unpaired 



bases). A single RNA band of about 1£ kb was 
visualized by Northern hybridization using an 
ATPase-specific probe (data not shown), consistent 
with a monocistronic mRNA terminated at the 
downstream hairpin structure. 

Sequence analysis 

The arc gene encodes a protein of 593 residues, 
which is 81% identical to the putative ATPases of 
M. leprae (GenBank U00017) and M. tuberculosis 
(GenBank Z97559) identified by genomic sequen- 
cing. From Alan to Gln77, it is predicted to con- 
tain a coiled-coil domain interrupted by Pro34 
(Figure 1). In the mycobacterial homologies 
(which share 92% identical residues), this proline is 
embedded in an insertion of nine residues. The 
part of the coiled-coil following ProM has the same 
length and location within the sequence as the 
coiled-coil predicted in the 26 S proteasome 
ATPases; this region <Ctu35 to Gln77) as well as 
the immediately adjacent sequence up to Glul47 
can be aligned to the proteasome ATPases using a 
Gibbs sampling procedure (Lawrence et al.. 1993), 
but the significance of the scores is not sufficient to 
allow the assignment of this ATPase to the subfam- 
ily of proteasome ATPases. Although ARC clearly 
belongs to the AAA family of ATPases, it has no 
detectable sequence similarity to other AAA pro- 
teins outside the AAA domain. 

The AAA domain of ARC extends from Val228 
to He533 and contains all the sequence blocks typi- 
cal for this family, as well as all the residues 
known or believed to be important for catalysis 
(Figure 2). It has a larger size due to two insertions 
of 15 and 33 residues (Ala293 to Ala307 and 
Val464 to Asn496, respectively), which are not seen 
in other AAA ATPases. In a distance-based den- 
drogram, it branches off close to the root of the 
AAA family and is not clearly assignable to any of 
its main branches (Figure 3). Sequence similarity is 
highest around the Walker A and B motifs, which 
are characteristic of the mononucleotide^ inding 
fold (also known as the P-loop NTPase domain). 
This fold consists of a doubly wound, five- 
stranded, parallel (J-sheet {for a structural classifi- 
cation of P-loop NTPases see the SCOP database at 
http://scop.mrc-imb.cam.acuk/scop). 
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Figure 1. Domain structure of 
ARC. The AAA domain (Val228 lo 
IW533) is marked by an open bar. 
The graph shows the probability of 
coiled-coil formation calculated 
with a 21 residue scanning win- 
dow. Below the graph, the regions 
containing the predicted coiled-coil 
segments in Rliodococcus and M]fcv- 
bacterium ARC are aligned, and the 
residues forming the hydrophobic 
core of the heptad repeat are 
labeled a and d. 
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A consensus secondary structure prediction 
was performed in order to locate the structural 
elements of the nucleotMe-binding fold within 
the conserved ATPase domain of the AAA 
family. This prediction represents the average of 
predictions by the method of Rost & Sander 



(1992), obtained separately for eight groups of 
AAA sequences: Rhodococcus and Mycobacterium 
ARC 26 S proteasome ATPases, Cdc48 homol- 
ogues (domain I), Cdc48 homologues (domain 2), 
ATPases involved in peroxisome biogenesis 
(domain 2), Seclo/NSF (domain 1), FfeH homol- 
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Figure 3. Distance-based dendrogram of the ATPase domains from proteins of the AAA family. 



ogues, and Meil homologues. For the ATPases 
involved in peroxisomal biogenesis and Seel 8/ 
NSF, which have two A'fPase domains, only one 
domain was included as the other is poorly 
conserved. These groups were selected on the 
basis of the dendrogram in Figure 3. Because of 
the large number of sequences considered and 
the wide range of pairwisc sequence identities,, 
the expected accuracy of the final consensus pre- 
diction is above 80% per residue. Jhe canonical 
fold of P-loop NTPases was easily recognized in 
a central region containing five predicted p/a 
elements, which are numbered in Figure 2. This 
secondary structure assignment is validated by 
the known location of the Walker A and B motifs 
at the C-terminai ends of strands pi and p3, 
respectively, and also by the amphipathic charac- 
ter of strands 02 and ffe, which are predicted to 
be the outer strands of the central p-sheet and 
thus partly solvent-exposed. The central nucleo- 
tide-binding fold is flanked on either side by pre- 
dicted a-helices; we refer to these regions as the 
helical extensions. Given that the N and C termi- 
ni of the nucleotidc-binding fold are in close 



proximity, it appears plausible that the helical 
extensions could form a common subdomain. 

Heterologous expression in £. colt 
and purification 

In order to obtain the enzyme in large quantities, 
we amplified the arc gene by PGR with primers 
encoding a C-terrninally fused His 6 -tag, and cloned 
the product into a phage T7 promoter-based 
expression vector. Induction with TPTG led to the 
production of a polypeptide of approximately 
66kDa as determined by SDS-PAGE (the calcu- 
lated molecular mass of ARC is 66.1 kDa), with a 
yield of about 80 g per liter of cell culture This 
polypeptide appeared only in cells carrying the 
recombinant plasmid (data not shown). Basal 
expression at a low level was detected in unin- 
duced cells, possibly as a result of read-through 
from the T7 promoter. Because of the tendency of 
the protein to form inclusion bodies, the amount of 
soluble protein was increased by growing the cells 
at room temperature and inducing the expression 
with a reduced IPTG concentration. 
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The protein was purified by affinity chromatog- 
raphy on a Ni-NTA resin, from which it eluted in a 
single peak with approximately 80% purity. Frac- 
tionation by size exclusion chromatography on a 
Supcrosc 6 column enhanced this purity to 95% 
(Figure 4(a)) and gave a yield of 20 to 30 mg per 
liter of cell culture. The N-tcrrninal sequence was 
determined by Fdman degradation and agreed 
with the deduced sequence of ARC but lacked the 
ammo-terminal methionine. On the Supexose 6 col- 
umn, the protein behaved as a Large oligomer of 



kDa 1 2 3 




c/) 

3 8 kDa 



760 




(b) 



about 900 kDa. Tn a hme-dependont manner, a 
minor second band appeared in the molecular 
mass range above 2 MDa (Figure 4(c)), indicating 
further aggregation of the oligomers. In native 
PAGE, the protein ran at about 769 kDa 
(Figure 4(b)), comparable to the 20 S proteasome of 
R. erythropdlis N186/21, indicating that under these 
conditions, the protein forms a dodecamer. The oli- 
gomeric ARC complexes were dissociated in 30% 
(w/v) urea, yielding one main dissociation product 
with the apparent molecular mass of an ARC 
dimer, but could not be reassembled after dialysis, 
whether or not the buffer contained ATP (data not 
shown). 

In parallel to these experiments, a construct was 
generated carrying an N-terminally fused His 6 -tag. 
In this form, the recombinant ARC protein could 
not be purified by affinity chromatography on a 
Ni-NTA resin, indicating that the N tc*rminus of 
the protein is not accessible (at least in the oligo- 
meric form). 

Constitutive expression of the native 
ATPase protein 

Polyclonal antibodies were raised against recom- 
binant ARC in order to estimate its expression rate 
in vh*o. To this end, the whole water-soluble cell 
extract of R. erylhropoltr. N186/21 was analyzed by 
two-dimensional gel electrophoresis, under dena- 
turing conditions in the pH range of 4 to 6 (the cal- 
culated pi of ARC is 4.9 and the experimentally 
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Figure 4. Gel elect rophorests of recombinant ARC com- 
plex stained with Coomassie blue after purificaHon 
using different chromatographic steps, (a) Tnane SD5- 
10% PAGE show a dominant protein band of 66 kDa. 
Lane 1, mcmbranc-froc cell extract of expression host of 
Bl^21 after 1PTG induction; lane 2, pooled ARC contain- 
ing fractions from Ni-NTA column; lane 3, Superose 6 
main fraction, (b) Pooled high molecular mass fraction 
of Superose 6 was loaded on a native 5% to 15 « gradi- 
ent PAGE according to Laemmli (1970) at pH 8-8- The 
running position of the ATPase relative to the 20 S pm- 
teasome is indicated: (c) Typical Superose 6 chromato- 
gram after running pooled Ni-NTA fractions containing 
ARC complexes. The main peak containing the ATPase 
elutes in a region between two internal standard pro- 
teins of molecular mass, 1060 kDa and 760 kDa for the 
thermosome from T. acicdcphUum and the 20 h protea- 
some from fL erythropolis, respectively. 



(a) 66 




fPG 



Ficure 5. Two-dimensional electrophoresis of the crude 
cell extract of K ttytkrvpol* from N186/12. Pivte.n 
samples (200 ug) in the first dimension were run on 
pH 4 to 6 IPG strips followed by SDS-PAGE m a 10% 
eel for running in Uic second dimension. After transfer 
Snto nitrocellulose membranes, proteins were visualized 
by either polyclonal antiserum agairtst recx>mbmanl 
ARC protein (a) or by Coomassie blue fining (b). Hie 
position of the ARC protein spots as judged hy immu- 
noblot analysis, is indicated with a box. 
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Figure 6. Plot of initial velocity versus ATP 
concentration. 



determined pi of the recombinant protein is 5J2). 
CJels were run in parallel blotted onto membranes, 
and either stained with Coomassie blue or incu- 
bated with the polyclonal antiserum. The location 
of native ARC in the two-dimensional pattern was 
determined by comparison of the two blots 
(Figure 5). ARC is expressed ccmstitutively in a 
relatively small amount of less than 0.1 % of the 
total protein under normal growth conditions; this 
Ls significantly lower than the expression level of 
the 20 S pro tea some in the same organism 
(Tamura ct id.. 1995). Growth of cells at the maxi- 
mum (34"C) and minimum (4°C) growth tempera- 
ture did not cause a visible change of the 
expression rate- On tmmunostained membranes, 
ARC yielded a series of spots of the same molecu- 
lar mass but of different pi values (Figure 5(a)), 
suggesting that the native protein may be multiply 
modified in vivo. 

Nucleotide binding and hydrolysis 

Recombinant ARC is purified as a stable oligo- 
meric complex m the absence of ATP. The protein 
has a Mg^-depcndent ATPase activity, whose kin- 



Table 2. Effect of salt inhibitor and nucleotide on the 
ATPase activity of recombinant ARC 



Addition 


Concern tratkm (mM) 


Activity (%) 


ConJrol 
MgCU 


10 


100 


Salt 
UDTA 

oa a 


11) 
10 


0 
0 


Inhibitor 
NEM 

ADP 


1(1 
3 
1 


5 
100 
100 


Nucleotide (instead of ATP) 

ADP ] 
GTP 1 

crp i 

UTP 1 


5 
0 
50 
0 



Nucleotide hvdnilyztiig activity is expressed as a percentage of 
the cuutroi reaction, which contained 50 mM Tris-HQ (pH 7.4), 
10 mM MgCk. 100 mM NaCl 1 mM ATT. 



etic parameters arc comparable to those of other 
AAA protein (Figure 6 and Table 1). Preincubation 
with the sulfhydryl-blocking agent N-ethylmalei- 
rnide (NEM, 10 mM) resulted in an almost com- 
plete inhibition of the ATPase activity, whereas the 
ATPase inhibitor sodium azide (NaN^ 3 mM) had 
no effect. NEM sensitivity appears to be a general 
property of AAA ATPases, which is surprising 
given that no cysteine residue is highly conserved 
in this family. The pH optimum of ARC lies 
between pH 7 and S and drops off rapidly outside 
this range. 

Nucleotide hydrolysis was also tested with ADP, 
GTP, CTP, and UTP (Table 2). OTP was cleaved at 
haLf the rate of ATP hydrolysis, but GTP or UTP 
were not substrates. ADP was cleaved at a low 
(5%) but detectable rate. No inhibition of ATPase 
activity by the reaction product ADP could be 
observed at equimolar concentration. This is in 
contrast to the observed inhibition of the AAA 
ATPase Cdc48, which is large even at substoichio- 
metric amounts of ADP (Frohlich ct at., 1995). 



Table 1. Kinetic parameters of ATP hydrolysis by members of the AAA family 



Protein 



Organism 



AKC 

VAT 

CDC48 

p97 

NSH 



VTS4p 
Yhs4p 
SUG1 

26 S proteatfome 
RC 



RiiatloctKXus trythniptdis 
Tftennapiasma acidophilum 
Sacdiaroniifces cerevtsiae 
Xcwjws Utevi$ 
Chinese haoelcr ovary 

(high affinity site) , 

(low affinity site) 
SacrJianmryasi unruhiiur. 
Sacdmtwtnfces cereviske 
Rat liver 

Rabbit reticulocytes 
Rabbit reticulocytes 





(pmol/min 


200 


268 


n.d. 


90 


550 


1560 


n.d. 


720 


650 


67 


110 


6 


43,300 


257 


n-d. 


450 


5 


7 


35 


7 


15 


IV 


30 


34 



Reference 



Ihis article 
Famnani et at. (1997) 
Frohlich et al. (1995) 
Peters ct a). (1992) 
Tagava et oi. (1993) 
Morgan et ni. (1994) 
Morgan t.i <ti. (1994) 
Babst et ah (1997) 
Luceiotf nJ. (1995) 
Makino ct al. (1997) 
Hoffman & Rcchsteinor (1996) 
Hoffman & Kechsteiher (1996) 



n.d., not determined 
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with a single dominant peak at 368{±52) kDa 
(n = 1661) with some tailing towards masses up to 
three times mis value (Figure 9(b)). On extrapol- 
ation to "zero dose" the measured mass becomes 
384(±54) kDa, which is close to six times the mol- 
ecular mass of tlie 66 kDa ARC monomer. In a 
control experiment, the ARC preparation was 
stabilized by glutaraldchyde crc^iiruong before 
adsorption- The mass histogram again displayed a 
single dominant peak, in this case at 415(d=55) kDa 
(n = 2897) after extrapolation to zero dose (data 
not shown). A small population of particles had 
masses at approximately twice this value, 
826(±96) kDa (n = 90). The 8% mass increase com- 
pared to the first result may be attributed to glutar- 
aldehyde (MOller et al., 1992). In gel filtration and 
in native PAGE we consistently found a higher 
molecular mass of approximately 760 kDa, indicat- 
ing that under these conditions the ARC rings 
associate into pairs; a similar phenomenon has 
been observed with some other members of the 
AAA family (Hanson et aL, 1997; Pamnani el aL, 
1997; Peters et at., 1992) and with proteasomal 
cc-rings (Zwickl et aL, 1994). 

Discussion 

Topology and active-site residues 

ARC undoubtedly belongs to the AAA ATPases - 
and is therefore built on a F-loop NTFase core, 
consisting of a doubly wound, five-stranded, paral- 
lel (Vsheet Among the NTPases of known struc- 
ture containing this fold are nucleotide kinases, G 
proteins, myosin, kinesm, KecA, DEAD-box heli- 
cases, and F, ATPase. The nucleotide is bound at 
the C-tcrroinal end of the central p-sheet in a cleft 
formed by loops connecting the central strands pi 
and (S3 to helices on opposite sides of the sheet. 




Structural analysis 

The purified ARC protein was examined by 
negative stain electron microscopy (TEM) and by 
scanning transmission electron microscopy (STEM) 
of unstaincd/freeze-dried samples to obtain 
quanurive data regarding its mass. In both uranyl 
acetate and ammonium molybdate-stained prep- 
arations, the ring-shaped ARC protein is observed 
almost exclusively in an end-on orientation 
(Figure 7). The diameter of the complex is 10 to 
11 run, almost the same as the diameter of the 20 S 
proteasome. Image averaging clearly reveals 6-fold 
symmetry; the six subunits enclose a central heav- 
ily stain-filled opening, or cavity, traversing the 
complex (Figure 7(b)). 

Some preliminary experiments undertaken were 
aimed at making two-dimensional crystals suitable 
for electron crystallography. We had some success 
with a lipid monolayer strategy using non-specific 
lipid mixtures from bovine brain extract, while 
experiments with Ni 2 " 1 NTA-cheiator lipids (Schmitt 
et aL, 1994) designed. to interact with the His^-tag 
of ARC have not yet yielded useful crystals. 
Although coherent two-dimensional crystalline 
patches were rather small (Figure 8(a)) some image 
processing was. possible. The resulting averages 
(Figure 8(b)) were in good agreement with the 
averages obtained from single particles but not 
superior to them. The power spectrum showed 
spots corresponding to a resolution of 1.92 nm; the 
resolution of the single particle average is 2.0 nm 
according to the radial correlation function cri- 
terion (Saxton & Baumeister, 1982). 

STEM images of unstained freeze-dried ARC 
preparations showed a relatively non-homo- 
geneous particle size distribution due to aggrega- 
tion effects (Figure 9(a)). Nevertheless, evaluation 
of a data set of 1834 particles yielded a histogram 
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The calculated diameter of the particle is around 10.3 nm. 



These loops are the most highly conserved part of 
the fold: their sequences are known as the Walker 
A and B motifs (Walker et aL, 1982), and contain 
the residues essential for coordinating the phos- 
phate croups of the nucleotide and the magnesium 
kin In most.IMoop NTPases, the Walker A and B 
motifs are adjacent, but in RecA, F, ATPase, and 
UEAD-box hclicases, they are separated by strand 
B4, resulting in a different topology of the central 
p-sheet, although active-site residues occupy equiv- 
alent positions. It has been proposed tfiat AAA 
proteins have a RecA-like topology and that by 
analogy to RecA and ATPase, they use a gluta- 
mate at the C-terminal end of p2 for the nucleophi- 
lic attack (Yoshida & Amano, 1995). We note that 
this residue is not invariant in AAA proteins and 
poorly conserved in Clp ATPases and Lon (Lupas 
el al 1997b). In DEAO-box helicases (which also 
have' a RecA-like topology) the glutamate of the 
DEAD motif at the C-termmai end of 03 inmates 
the nucieophilic attack (Subramanya el aL, 1996), 
and AAA, Clp, and Lon ATPases all contain a 
DExx motif after p3, suggesting that here also this 
glutamate is the nucieophilic residue. 

in manv P-loop NTPases, a residue C-terminal to 
P4 has been proposed to sense the presence of the 
v-phosphate and transmit conformational changes 



resulting from nucleotide hydrolysis. This residue 
is Cinl94 in RecA (Story & Steitz, 1992), Cln254 in 
the DExx-box hclicase PrcA (Subramanya et al, 
' 1996), and Lysl72 in 6-phosphofruclo-2-kinase/ 
fructose-2,6~bisphosphatase (Hasemann et aL, 
1996). In AAA proteins, two residues are likely- 
candidates for this function: an asparagine at the C 
terminus of p4 (Asn398 in ARC"), wlxose location 
corresponds to that of RecA Glnl94 and PrcA 
Gln254, and an arginine at the N terminus of p5 
(Arg412 in ARC), which is conserved in all AAA, 
Clp and lx>n proteins (Lupas et aL. 1997b). 

Is ARC a proteasome-like ATPase? 

The Rhodococats proteasomc, like 20 S pro tea- 
somes from other sources, archaebacterial or 
eukaryotic, can only degrade unfolded polypep- 
tides (Wenzcl & Baumeister, 1995). Therefore, it 
requires the cooperation of an energy-dependent 
svslern binding the target proteins and presenting 
them in an unfolded form to the proteolytic 20 S 
core complex. Although there is conceptually no 
need for the formation of a stable protease-ATPasc 
complex, a transient interaction seems sufficient, 
most known self-compartmentalizing proteases in 
fact form such complexes. While the evolutionary 
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figure 9. Mass determination of the ARC protein by 
means of STEM, (a) Elastic annular dark-field STEM 
images of purified recombinant AKC particles, (b) Histo- 
gram of particle/mass distribution of ARC shown with 
a filled Gauss curve. 



background of the protease components in such 
complexes is quite different, the ATPases are all 
P-loop NTPascs with characteristic common 
sequence motifs beyond the Walker A and B motifs 
(Lupas et ah, 1997a). In eukaryotic proteasomcs the 
ATPases are part of the 19 S cap complex which 
associates with the 20 S core to form the 26 S pro- 
teasomes. In addition to the six different but 
related ATPase subunits, the 19 S cap complex con- 
tains at least ten other subunits, some of which 
provide the link to the ubiquitin system. The pro- 
teasomal ATPases form a distinct subgroup of Hie 
AAA ATPase family (Figure 3). In archaebacteria a 
homologue of this subgroup has been identified 
(Bulttf <3/.,1996). 

The occurrence of the arc gene in the vicinity of 
the proteasome operons in the actinomyccbcs Rho 
dococcus and Mycobacterium, and its absence from 
other eubacteria whose genome has been comple- 
tely sequenced and which do not contain protea- 
somcs raises the possibility that ARC functions in 
the context of the proteasome. Although the sig- 



nificance of blocks of sequence similarity to protea- 
some ATPases is too low to unambiguously 
classify it as a proteasomal ATPase, ARC has a 
simi la r topology, consisting of an N-terminal 
coiled-coti region, a central domain predicted to 
consist mainly of fl-strands, and C-tcrminal AAA 
domain. This topology is typical for proteasome 
ATPases. It is interesting to note that recombinant 
ARC assembles spontaneously into a ring structure 
with almost exactly the dimensions of tlx? 20 S pro- 
teasome. The two complexes differ, however, in 
symmetry, which is 7-fold in the case of the Riiodo- 
coccus proteasome (Ztthl el al, 1997) and 6-fold in 
the case of ARC- Such a mismatch begins to 
emerge as a common principle in protease- ATPase 
complexes as was noted for tlui ClpAP complex 
(Flanagan ct aK 1995; Kessel et al, 1995) and might 
also apply to Hsl VU; in the latter case the isolated 
HsfU complex shows some structural polyn>orprt- 
ism, i.e. complexes built from six and seven sub- 
units coexist (Rohrwild ct at., 1997). The functional 
significance of such a mismatch is still a matter for 
speculation. 

So tar we have not yet succeeded in reconstitut- 
ing in vitro a functional ATPase-dependent system 
capable of degrading folded substrate proteins 
from purified Rkodococciis 20 S pro tan somes and 
ARC. Ihus, it is reasonable to assume that the 
association constant for the two protein complexes 
might be too low for detecting specific interactions 
outside the cell (EJlis, 1997). We further cannot rule 
out that additional factors arc required tor a suc- 
cessful reconsHtution; the product of the highly 
conserved or/7 gene, which in Rhodococcus and 
Mycobacterium is part of the proteasome gene oper- 
on (Tamuxa ct /?/., 1995; Knipfer & Shrader, 1997), 
is a possible candidate. 

Materials and Methods 

The nucleotide sequence of the arc gerie was sub- 
mitted to the CtenBank with accession number 
AF028326. 

All chemicals were of reagent grade or better and pur- 
chased as indicated. Solutions wen? prepared using 
MiHiO water (MilKpore). 

Sequence analysis of ARC from A erythropotis 
NI86/21 

Further characterization of the UNA region upstream 
of the proteasome structural genes prc^ and prcA^ pre- 
viously cloned in /. EMBL3 (clone FAJ2029: Tamura ei ul., 
1995)/ was carried out by DNA sequencing of overlap- 
ping subcloned fragments with an automated sequencer 
(A.L.F., Pharmacia Biotech). The sequence analysis of the 
region downstream of these proteasome genes has been 
reported (Nagy et aL, 1997b). Subsequent assembly was 
performed with the PCCIENE software package (Intelli- 
Genetics). For identification of potential coding regions, 
the GCWIND program (Shields et al. 1992) was used. 

Sequence similarity searches were pcrfonnod using 
the BLAST WWW-page at NCBI (http://www. 
ncbLnlm/nih.giov/cgi-bin/ BLAST). The coiled<roii pre- 
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diction in Figure 1 was obtained from the COILS server 
at htlp://ukec3.unil.ch/software/COILS^form.html. 
The alignment in Figure 2 was generated in MACAW 
(Schuier et aL, 1991). The dendrogram in Figure 3 was 
computed for the AAA ATPase domains only by the 
DARWIN E-mail server (cbrg@ixif.eth/~ch). For the con- 
sensus secondary structure predictions, proteins from 
individual branches of the AAA family were aligned in 
IHicUp (Genetics Computer Group, Madison, W% USA), 
and the alignments were submitted to the PHD 
server (htip:// wovw.embl-heidelberg.de/predictprotetn/ 
phd_pred.html). After alignment of the entire family in 
MACAW/ the predictions were averaged to yield the 
final prediction shown in Figure 2. 

RNA isolation and Northern hybridization 

Total RNA isolation from R. erythropolh NI86/21 and 
Northern analysis were carried out as described (Nagy 
ct al. f 1997a). The riboprobc for Northern hybridization 
was prepared using pPAJ2355 linearized with Ecll3611 as 
a template. Tlus vector consists of pBlucScipt SK-t- with 
the internal 468 bp B$M-PuvU fragment of the ATPase 
gene cloned in the BamHl-Sntal sites. 

Cloning and overexpression experiments 

For the enrichment of the ATPase gene by PCR, two 
sets of His^-tag fused oligonucleotides based on the 
genomic N and C- terminal sequence were synthesized. 
The 5' primers containing an Ndel site and the 3' primers 
containing an EcoRl site had the following sequence 5'- 
AT ATA TAT CAT A PC AGC I CG ACA GAG AAC 
CCG GAT TCG-3' (5' AT ATA TAT CAT ATG CAT CAC 
CAT CAC CAT CAC ATG AGC TCG ACA GAG AAC 
CCG GAT T-30 and 5'-TG GAA TPC CTA CTA CAG 
GTA CTG CCC CGT GTT CGA CTC-? (5'-TG CAA 
TTX: CTA CTA GIG ATG GFG ATG ClG ATG CAG 
GTA CIG GCC CGT GIT CGA CTC-30. PCR was per- 
formed with these primers and plasmid DNA 
(pFAJ2496) containing the complete ATPase gene using 
cloned Pfii polymerase (Stratagene) in a 25 ul reaction 
and cycled 25 times at 56°C annealing temperature. The 
PCR product was electrophoresed in 1.0% (w/v) agarose 
and the 1.8 kb band was excised from the gel and puri- 
fied using the Qiaex DNA extraction kit (Qiagen). The 
el u bed PCR fragment was cleaved with Ndel and Eco RI 
(Boebriuger), ligated into dephosphorylated NdcI/rajRI 
cut pT7-7 vector and transformed into host strain E. coli 
XLl-blue. Positive clones containing the ATPase gene 
were scquci>ccd and transformed into the expression 
strain E. coli Bb-21 (Dfc3). 

For screening for overexpression, small-scale cultures 
of selected clones were grown at 37'C in 3 ml of Luxia 
broth (LB), containing ampicillin (100 jig/mi), to an Agm 
of 0.7, induced with 2.0 mM isopropyl-l-thio-fJ-O-galac- 
topyranoside (IPTG). and grown for a further four hours. 
The cells were harvested and the HtSr.-tagged protein 
purification was performed under native or denaturing 
conditions as described (QlAcxpressionist, Qiagen). The 
level of induction was monitored using SOS- 10% poly- 
acryiamide gel electrophoresis. Under optimized con- 
ditions, the expression was performed in oiw litre of LB 
medium containing 100 ug/ml ampiciUin inoccuiated 
1:200 with the overnight culture and grown at room tem- 
perature until the cell density reached an A^ of 1.5, 
before 0.5 mM 1PTG was added- The culture was then 
allowed to grow for a further two hours. 



Purification of recombinant ARC protein Irom £T. coti 

The cells were harvested by ccntrifugatioo at 5000 % 
and Ivsed by sonicalkm of a 20% (w/v) cell suspension 
in buffer A (50 mM sodium phosphate (pH 7.4), 300 mM 
NaCI, 15 mM 0-mercaptoethanol) after incubation on ice 
for 30 minutes with 1 mg/ml lysozyme. After removal 
of the cell debris by cenbrifugation at 20,000 g. the super- 
natant was loaded with a flow rate of 15 cm /hour on a 
4 ml Ni-NTA column, previously equilibrated with buf- 
fer A. The column was waslied with the same buffer at 
pH 6.0 and bound proteins were eluted with a linear 
gradient of 0 to 03 mM imidazo! in buffer A (10 vol/ml 
resin). Fractions containing the ARC were identified by 
SDS-10% PAGE. The semi-purified HLs^-tagged protein 
was further purified on a Superose 6 column (Pharma- 
cia), in buffer B (25 mM Tris-HCl (pH 7.4), 100 mM 
NaCI, 1 mM DTT) after it hod been concentrated to 
15 mg/ml using Centricon concentrators (Amicon). For 
storage, glycerol was added to the protein solution to a 
final concentration of 20% (w/v). 



ATPase assays 

Assays were routinely performed in 1-5 ml polyprop- 
ylene microcentrifuge tubes for 30 minutes at 37 C C. 
ATPase activity was determined by assaying the release 
of inorganic phosphate using the spectrophotometric 
method of Lanzetta et al (1979) with meliorations. 
A tvpical ATPase assay mixture (50 ul) contained 1.0 ug 
of freshly prepared ARC protein in buffer C (50 mM 
Tris-HCl (pM 7.4), 100 mM NaCI, 10 mM Mgd 2 ) and 
was started by the addition of 1 mM ATP. The reaction 
was stopped by the addition of 800 id of reaction sol- 
ution and 100 ul of 30% (w/v) citric acid and the absorp- 
tion change was monitored using a^ photometer at a 
wavelength of 640 nm. To test for Mg 2 * dependence the 
divalent cation was either chelated by the addition of 
10 mM EDTA or replaced by 10 mM CaCl 2 in a Mg 2 * - 
free buffer. When the nucleotide hydrolyztng activity 
was tested for other nucleoside phosphates, GXP, CEP, 
UTP and ADP instead of ATP were added at 1 mM 
initial concentration to start the reaction. The ATPase 
activity at various pH values (5.0, 6-0, 70, 8.0, 9.0) was 
determined in standard assay buffer C, where Tris-HCl 
was replaced by Tris-citrate of the appropriate pH. To 
test the sensitivity of the enzyme for ATPase inhibitors/ 
N-cthyui\alcdmide (10 mM) or sodium azidc (3 mM) 
were added to the assay mixture and incubated on ice 
for 15 minutes before the reaction was started. The kin- 
etic parameters, (Michaelis constant) and (maxi- 
mum velocity) were determined within the first ten 
minutes of assay conditions bv measuring the initial vel- 
ocity of ATP hydrolysis in triplicate. From the plot of the 
initial velocity of a simple Michael is-Men ten reaction ver- 
sus the substrate concentration, the corresponding K m 
and V %tWA values were calculated by non-linear regression 
methods (Cleland, 1979). It should be mentioned that the 
highest ATPase activity was always recorded with 
freshly prepared recombinant protein. The hydrolysis 
rate dropped down sharply within a few days when the 
enzyme was stored at 4'C However, the addition of 
20% glycerol had a stabilizing effect, which protected the 
activity almost completely. 

Preparation of antiserum 

Recombinant ARC protein (SOU ug) purified by 
Ni-NTA and Superose 6 chromatography was dialyzed 
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against phosphate-buffered saline and homogenized in 
an equal volume of complete Freund's adjuvant (CFA). 
A New Zealand rabbit was immunized subcutaneoiisly 
on day one followed by a second and third injection at 
days 21 and 42, respectively. Three weeks after the last 
injection whole blood was collected,, allowed to dot for 
60 minutes at 3TC and serum was obtained from the 
clot by removing the insoluble material by centrifu- 
gatian. 

Gel electrophoresis and immunoblotting 

SDS-FACE using 10% gels was performed as 
described (Schagger & von Jagow, 1987), Native gels 
with a linear gradient from 5% to 15% were run as 
described (Zuhl ct nL, 1997). Two-dimensional gel elec- 
trophoresis was performed using the method described 
by Gdrg et at. (1988). Proteins (200 ng) in the first dimen- 
sion were run on immobilized pH gradient (IPG) gel 
strips with a linear gradient of pH 4 to 6 (prepared 
according to the description of the manufacturer; Phar- 
macia) and applied to denaturing SDS-10% PAGE for 
running in the second dimension. Visualization of the 
proteins was done after transfer onto nitrocellulose mem- 
branes according to Towbin et aL (1979) with either Coo- 
massie-blue for total protein or with bromochloroindolyl 
phosphate/nitro blue tetrazolium (BCIP/NBT) for 
immunodetection with alkaline phosphatase. 

Two-dimensional crystallization of ARC 

Two-dimensional crysta Nutation experiments were 
performed using a lipid monolayer technique. A small 
Teflon trough containing 500 ul of an aqueous subphasc 
{10 mM Mes-KOH (pH 5.4), 200 mM Nad, 20 mM 
CdCl^ 5 mM (Nri^SOt, 2% (w/v) glucose) was used. 
Then 10 ul of a solution of 0.25 mg/mi bovine brain 
extract (Sigma) in chk>roform/«-hexane (1:1, v/v) was 
spread onto it and allowed to stand for 15 minutes. The 
ARC protein solution (5 ul, 20 mg/ml) was then injected 
into the subphase, and incubated for 15 minutes to allow 
adsorption of tlic protein onto the lipid monolayer. 1 lie 
monolayer was transfered by placing a grid with a holey 
carbon film onto the surface and lifting it off. Samples 
were negatively stained with uranyi acetate (2% (w/v)) 
for transmission electron microscopy (TEM). 

Transmission electron microscopy (TEM) and 
scanning transmission electron microscopy (STEM) 

Samples from the Superose 6 fractions were brought 
to a concentration of 100 ug/ml and adsorbed onto 
glow-discharged carbon support films. TEM was per- 
formed using a Philips EM 420 at an accelerating voltage 
of 100 kV. Electron micrographs were routinely recorded 
at magnifications from 30,000 x to 46,000*. Single par- 
ticle averaging was performed as described (Baumeister 
et al. t 1988). 

STEM mass measurement was performed as described 
by Engel (1978) to determine the molecular mass of the 
ARC protein. For this, the ATl'asc was diluted with 
25 mM Mops buffer without p-rnercaptoethanol, to a 
concentration of approximately 50 ug/ml. Adsorption 
was to glow-discharged thin carbon films supported by 
a fenestrated thick carbon layer on 200-mesh gold-coated 
copper grids. The grids were then washed four limes on 
droplets of quartz-bidtstUIcd water and freeze dried at 
-80°C in the microscope overnight In a second exper- 



iment the complex was cross-linked, with 0.1% (v/v) 
glutaraldehyde by a 30 minute incubation at room tem- 
perature prior to adsorption. The mass evaluations were 
carried out as described by MQllcr el at. (1992). 
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